Complexity and correctness of a super-pipelined processor

نویسنده

Jochen Preiß

چکیده

This thesis introduces the DLXπ+, a super-pipelined processor with variable cycle time. The cycle time of the DLXπ+ may be as low as 9 gate delays (including 5 gate delays for registers), which is assumed to be a lower bound for the cycle time. For the parts of the DLXπ+ that significantly differ form previous implementations correctness proofs are provided. Formulas are developed which compute restrictions to the parameters of the DLXπ+, e.g., the maximum number of reservation station entries for a given cycle time. The formulas also compute what modifications to the base design have to be made in order to realize a certain cycle time and what the impact is on the number of pipeline stages. This lays the foundation for computing the time per instruction of the DLXπ+ for a given benchmark and different cycle times in future work in order to determine the “optimum” cycle time. Kurzzusammenfassung In dieser Arbeit wird die DLXπ+ eingeführt, ein super-gepipelineter Prozessor mit variabler Zykluszeit. Die Zykluszeit der DLXπ+ kann bis auf 9 Gatter-Delays (inklusive 5 Gatter-Delays für Register) reduziert werden, was als untere Schranke für die Zykluszeit angesehen wird. Für die Teile der DLXπ+, die sich signifikant von bisherigen Implementierungen unterscheiden, werden Korrektheits-Beweise geliefert. Desweiteren werden Formeln entwickelt, die Beschränkungen für die Parameter der DLXπ+ wie zum Beispiel die maximale Anzahl von Reservation Station Einträgen für eine gegebene Zykluszeit berechnen. Die Formeln errechnen ausserdem welche Modifikationen am Basis-Design notwendig sind, um eine bestimmte Zykluszeit zu erreichen und welchen Einfluss dies auf die Anzahl der Pipeline-Stufen hat. Damit wird die Grundlage gelegt, um als zukünftige Arbeit die benötigte Zeit pro Instruktion der DLXπ+ für einen gegebenen Benchmark bei verschiedenen Zykluszeiten zu berechenen und damit die “optimale” Zykluszeit zu bestimmen.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine withOut - of - Order Instruction Completion

1 Project Goal and Overview Our goal in this project is showing that the veriication of complex pipelined machines is possible. As we discussed in the previous sections, the micro-architectural designs of general purpose microprocessors have not been thoroughly studied as a target of formal veriication. It is not clear even how to represent the correctness of the super-scalar super-pipelined ma...

متن کامل

Efficient implementation of low time complexity and pipelined bit-parallel polynomial basis multiplier over binary finite fields

This paper presents two efficient implementations of fast and pipelined bit-parallel polynomial basis multipliers over GF (2m) by irreducible pentanomials and trinomials. The architecture of the first multiplier is based on a parallel and independent computation of powers of the polynomial variable. In the second structure only even powers of the polynomial variable are used. The par...

متن کامل

Verification of in-order execution in pipelined processors

As embedded systems continue to face increasingly higher performance requirements, deeply pipelined processor architectures are being employed to meet desired system performance. System architects critically need modeling techniques that allow exploration, evaluation, customization and validation of different processor pipeline configurations, tuned for a specific application domain. We propose...

متن کامل

Decomposing the Proof of Correctness of pipelined Microprocessors

We present a systematic approach to decompose and incrementally build the proof of correctness of pipelined microprocessors The central idea is to construct the abstraction function using comple tion functions one per un nished instruction each of which specify the e ect on the observables of completing the instruction In addition to avoiding term size and case explosion as could happen for dee...

متن کامل

Design and Implementation of Digital Demodulator for Frequency Modulated CW Radar (RESEARCH NOTE)

Radar Signal Processing has been an interesting area of research for realization of programmable digital signal processor using VLSI design techniques. Digital Signal Processing (DSP) algorithms have been an integral design methodology for implementation of high speed application specific real-time systems especially for high resolution radar. CORDIC algorithm, in recent times, is turned out to...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Complexity and correctness of a super-pipelined processor

نویسنده

چکیده

منابع مشابه

Machine withOut - of - Order Instruction Completion

Efficient implementation of low time complexity and pipelined bit-parallel polynomial basis multiplier over binary finite fields

Verification of in-order execution in pipelined processors

Decomposing the Proof of Correctness of pipelined Microprocessors

Design and Implementation of Digital Demodulator for Frequency Modulated CW Radar (RESEARCH NOTE)

عنوان ژورنال:

اشتراک گذاری